Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 822 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 196.1 KiB |
| Average record size in memory | 244.2 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 10 |
Dataset
| Description | Este es un analisis preeliminar para comprender de mejor forma los datos de nuestro dataset |
|---|---|
| Author | Kenneth David Leonel Triana , Juan Jose Naranjo, Alejandro Mora |
| URL | https://github.com/kennethLeonel/Monografia-calidad-del-aire-valle-de-aburra |
anio has constant value "2024" | Constant |
festivo is highly imbalanced (72.5%) | Imbalance |
p1 is highly imbalanced (52.0%) | Imbalance |
codigoserial is uniformly distributed | Uniform |
dia_semana is uniformly distributed | Uniform |
estacion is uniformly distributed | Uniform |
presion has 242 (29.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-16 23:49:41.410582 |
|---|---|
| Analysis finished | 2024-10-16 23:50:11.715107 |
| Duration | 30.3 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
anio
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.4 KiB |
| 2024 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3288 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024 |
|---|---|
| 2nd row | 2024 |
| 3rd row | 2024 |
| 4th row | 2024 |
| 5th row | 2024 |
Common Values
| Value | Count | Frequency (%) |
| 2024 | 822 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2024 | 822 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1644 | |
| 0 | 822 | |
| 4 | 822 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3288 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1644 | |
| 0 | 822 | |
| 4 | 822 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3288 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1644 | |
| 0 | 822 | |
| 4 | 822 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3288 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1644 | |
| 0 | 822 | |
| 4 | 822 |
mes
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0072993 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.5816641 |
|---|---|
| Coefficient of variation (CV) | 0.51558014 |
| Kurtosis | -1.2270774 |
| Mean | 5.0072993 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.010631573 |
| Sum | 4116 |
| Variance | 6.6649893 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 93 | |
| 3 | 93 | |
| 5 | 93 | |
| 7 | 93 | |
| 8 | 93 | |
| 4 | 90 | |
| 6 | 90 | |
| 9 | 90 | |
| 2 | 87 |
| Value | Count | Frequency (%) |
| 1 | 93 | |
| 2 | 87 | |
| 3 | 93 | |
| 4 | 90 | |
| 5 | 93 | |
| 6 | 90 | |
| 7 | 93 | |
| 8 | 93 | |
| 9 | 90 |
| Value | Count | Frequency (%) |
| 9 | 90 | |
| 8 | 93 | |
| 7 | 93 | |
| 6 | 90 | |
| 5 | 93 | |
| 4 | 90 | |
| 3 | 93 | |
| 2 | 87 | |
| 1 | 93 |
dia
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.729927 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8023391 |
|---|---|
| Coefficient of variation (CV) | 0.55959186 |
| Kurtosis | -1.1958321 |
| Mean | 15.729927 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.0050555266 |
| Sum | 12930 |
| Variance | 77.481174 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 27 | 3.3% |
| 2 | 27 | 3.3% |
| 29 | 27 | 3.3% |
| 28 | 27 | 3.3% |
| 27 | 27 | 3.3% |
| 26 | 27 | 3.3% |
| 25 | 27 | 3.3% |
| 24 | 27 | 3.3% |
| 23 | 27 | 3.3% |
| 22 | 27 | 3.3% |
| Other values (21) | 552 |
| Value | Count | Frequency (%) |
| 1 | 27 | |
| 2 | 27 | |
| 3 | 27 | |
| 4 | 27 | |
| 5 | 27 | |
| 6 | 27 | |
| 7 | 27 | |
| 8 | 27 | |
| 9 | 27 | |
| 10 | 27 |
| Value | Count | Frequency (%) |
| 31 | 15 | |
| 30 | 24 | |
| 29 | 27 | |
| 28 | 27 | |
| 27 | 27 | |
| 26 | 27 | |
| 25 | 27 | |
| 24 | 27 | |
| 23 | 27 | |
| 22 | 27 |
pm25
Real number (ℝ)
| Distinct | 608 |
|---|---|
| Distinct (%) | 74.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 214.97324 |
| Minimum | -9999 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4 |
| Negative (%) | 0.5% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -9999 |
|---|---|
| 5-th percentile | 10.53648 |
| Q1 | 14.391738 |
| median | 18.408425 |
| Q3 | 24.935037 |
| 95-th percentile | 37.611642 |
| Maximum | 99999 |
| Range | 109998 |
| Interquartile range (IQR) | 10.5433 |
Descriptive statistics
| Standard deviation | 4980.0982 |
|---|---|
| Coefficient of variation (CV) | 23.166131 |
| Kurtosis | 392.58198 |
| Mean | 214.97324 |
| Median Absolute Deviation (MAD) | 4.747375 |
| Skewness | 19.601092 |
| Sum | 176708.01 |
| Variance | 24801378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 14 | 1.7% |
| 19 | 11 | 1.3% |
| 16.5 | 11 | 1.3% |
| 18.5 | 10 | 1.2% |
| 15 | 9 | 1.1% |
| 18 | 9 | 1.1% |
| 13.5 | 8 | 1.0% |
| 20 | 8 | 1.0% |
| 17.5 | 7 | 0.9% |
| 14 | 7 | 0.9% |
| Other values (598) | 728 |
| Value | Count | Frequency (%) |
| -9999 | 4 | |
| 1 | 3 | |
| 5.36807 | 1 | 0.1% |
| 5.56998 | 1 | 0.1% |
| 6.12773 | 1 | 0.1% |
| 6.662345 | 1 | 0.1% |
| 6.685655 | 1 | 0.1% |
| 7.309475 | 1 | 0.1% |
| 7.5 | 2 | |
| 7.85481 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 2 | |
| 49 | 1 | |
| 46.81495 | 1 | |
| 46.5 | 1 | |
| 46.2742 | 1 | |
| 45.8667 | 1 | |
| 45.5 | 1 | |
| 45 | 1 | |
| 44.8498 | 1 | |
| 44.8023 | 1 |
codigoserial
Categorical
UNIFORM 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.8 KiB |
| 28 | |
|---|---|
| 69 | |
| 86 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1644 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 28 |
|---|---|
| 2nd row | 28 |
| 3rd row | 28 |
| 4th row | 28 |
| 5th row | 28 |
Common Values
| Value | Count | Frequency (%) |
| 28 | 274 | |
| 69 | 274 | |
| 86 | 274 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 28 | 274 | |
| 69 | 274 | |
| 86 | 274 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 548 | |
| 6 | 548 | |
| 2 | 274 | |
| 9 | 274 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1644 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8 | 548 | |
| 6 | 548 | |
| 2 | 274 | |
| 9 | 274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1644 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8 | 548 | |
| 6 | 548 | |
| 2 | 274 | |
| 9 | 274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1644 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8 | 548 | |
| 6 | 548 | |
| 2 | 274 | |
| 9 | 274 |
dia_semana
Categorical
UNIFORM 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.4 KiB |
| Lunes | |
|---|---|
| Martes | |
| Miercoles | |
| Jueves | |
| Viernes | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.5656934 |
| Min length | 5 |
Characters and Unicode
| Total characters | 5397 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lunes |
|---|---|
| 2nd row | Martes |
| 3rd row | Miercoles |
| 4th row | Jueves |
| 5th row | Viernes |
Common Values
| Value | Count | Frequency (%) |
| Lunes | 120 | |
| Martes | 117 | |
| Miercoles | 117 | |
| Jueves | 117 | |
| Viernes | 117 | |
| Sabado | 117 | |
| Domingo | 117 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| lunes | 120 | |
| martes | 117 | |
| miercoles | 117 | |
| jueves | 117 | |
| viernes | 117 | |
| sabado | 117 | |
| domingo | 117 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 939 | |
| s | 588 | |
| o | 468 | 8.7% |
| n | 354 | 6.6% |
| i | 351 | 6.5% |
| a | 351 | 6.5% |
| r | 351 | 6.5% |
| u | 237 | 4.4% |
| M | 234 | 4.3% |
| L | 120 | 2.2% |
| Other values (12) | 1404 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5397 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 939 | |
| s | 588 | |
| o | 468 | 8.7% |
| n | 354 | 6.6% |
| i | 351 | 6.5% |
| a | 351 | 6.5% |
| r | 351 | 6.5% |
| u | 237 | 4.4% |
| M | 234 | 4.3% |
| L | 120 | 2.2% |
| Other values (12) | 1404 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5397 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 939 | |
| s | 588 | |
| o | 468 | 8.7% |
| n | 354 | 6.6% |
| i | 351 | 6.5% |
| a | 351 | 6.5% |
| r | 351 | 6.5% |
| u | 237 | 4.4% |
| M | 234 | 4.3% |
| L | 120 | 2.2% |
| Other values (12) | 1404 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5397 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 939 | |
| s | 588 | |
| o | 468 | 8.7% |
| n | 354 | 6.6% |
| i | 351 | 6.5% |
| a | 351 | 6.5% |
| r | 351 | 6.5% |
| u | 237 | 4.4% |
| M | 234 | 4.3% |
| L | 120 | 2.2% |
| Other values (12) | 1404 |
estacion
Categorical
UNIFORM 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.8 KiB |
| Estacion Itagui | |
|---|---|
| Estacion Caldas | |
| Estacion Aranjuez |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 15.666667 |
| Min length | 15 |
Characters and Unicode
| Total characters | 12878 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Estacion Itagui |
|---|---|
| 2nd row | Estacion Itagui |
| 3rd row | Estacion Itagui |
| 4th row | Estacion Itagui |
| 5th row | Estacion Itagui |
Common Values
| Value | Count | Frequency (%) |
| Estacion Itagui | 274 | |
| Estacion Caldas | 274 | |
| Estacion Aranjuez | 274 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| estacion | 822 | |
| itagui | 274 | 16.7% |
| caldas | 274 | 16.7% |
| aranjuez | 274 | 16.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1918 | |
| t | 1096 | 8.5% |
| i | 1096 | 8.5% |
| n | 1096 | 8.5% |
| s | 1096 | 8.5% |
| E | 822 | 6.4% |
| c | 822 | 6.4% |
| o | 822 | 6.4% |
| 822 | 6.4% | |
| u | 548 | 4.3% |
| Other values (10) | 2740 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12878 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1918 | |
| t | 1096 | 8.5% |
| i | 1096 | 8.5% |
| n | 1096 | 8.5% |
| s | 1096 | 8.5% |
| E | 822 | 6.4% |
| c | 822 | 6.4% |
| o | 822 | 6.4% |
| 822 | 6.4% | |
| u | 548 | 4.3% |
| Other values (10) | 2740 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12878 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1918 | |
| t | 1096 | 8.5% |
| i | 1096 | 8.5% |
| n | 1096 | 8.5% |
| s | 1096 | 8.5% |
| E | 822 | 6.4% |
| c | 822 | 6.4% |
| o | 822 | 6.4% |
| 822 | 6.4% | |
| u | 548 | 4.3% |
| Other values (10) | 2740 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12878 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1918 | |
| t | 1096 | 8.5% |
| i | 1096 | 8.5% |
| n | 1096 | 8.5% |
| s | 1096 | 8.5% |
| E | 822 | 6.4% |
| c | 822 | 6.4% |
| o | 822 | 6.4% |
| 822 | 6.4% | |
| u | 548 | 4.3% |
| Other values (10) | 2740 |
festivo
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0 | |
|---|---|
| 1 | 39 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 822 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 822 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 822 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 822 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 783 | |
| 1 | 39 | 4.7% |
temperatura
Real number (ℝ)
| Distinct | 219 |
|---|---|
| Distinct (%) | 26.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -74.148966 |
| Minimum | -999 |
|---|---|
| Maximum | 25.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 19.5 |
| median | 21.3 |
| Q3 | 22.9 |
| 95-th percentile | 24.195 |
| Maximum | 25.5 |
| Range | 1024.5 |
| Interquartile range (IQR) | 3.4 |
Descriptive statistics
| Standard deviation | 297.51698 |
|---|---|
| Coefficient of variation (CV) | -4.0124225 |
| Kurtosis | 5.8206594 |
| Mean | -74.148966 |
| Median Absolute Deviation (MAD) | 1.6975 |
| Skewness | -2.7939302 |
| Sum | -60950.45 |
| Variance | 88516.354 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 21 | 20 | 2.4% |
| 23.5 | 16 | 1.9% |
| 22.4 | 16 | 1.9% |
| 23.4 | 15 | 1.8% |
| 23 | 15 | 1.8% |
| 20.1 | 14 | 1.7% |
| 22.5 | 14 | 1.7% |
| 22.9 | 14 | 1.7% |
| 21.9 | 14 | 1.7% |
| Other values (209) | 607 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 16.1 | 1 | 0.1% |
| 16.5 | 1 | 0.1% |
| 16.7 | 1 | 0.1% |
| 16.8 | 3 | 0.4% |
| 16.9 | 2 | 0.2% |
| 17 | 5 | 0.6% |
| 17.1 | 1 | 0.1% |
| 17.2 | 4 | 0.5% |
| 17.3 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 25.5 | 1 | 0.1% |
| 25.375 | 1 | 0.1% |
| 25.1 | 1 | 0.1% |
| 25.09 | 1 | 0.1% |
| 25 | 1 | 0.1% |
| 24.995 | 1 | 0.1% |
| 24.9 | 1 | 0.1% |
| 24.85 | 1 | 0.1% |
| 24.8 | 4 | |
| 24.799999 | 1 | 0.1% |
humedad
Real number (ℝ)
| Distinct | 426 |
|---|---|
| Distinct (%) | 51.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -25.815931 |
| Minimum | -999 |
|---|---|
| Maximum | 91.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 65.8125 |
| median | 74.05 |
| Q3 | 81.15 |
| 95-th percentile | 86.1 |
| Maximum | 91.8 |
| Range | 1090.8 |
| Interquartile range (IQR) | 15.3375 |
Descriptive statistics
| Standard deviation | 313.16084 |
|---|---|
| Coefficient of variation (CV) | -12.130527 |
| Kurtosis | 5.8102623 |
| Mean | -25.815931 |
| Median Absolute Deviation (MAD) | 7.55 |
| Skewness | -2.7907922 |
| Sum | -21220.695 |
| Variance | 98069.709 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 81 | 14 | 1.7% |
| 78 | 12 | 1.5% |
| 73 | 12 | 1.5% |
| 82 | 12 | 1.5% |
| 76 | 12 | 1.5% |
| 85 | 11 | 1.3% |
| 75 | 10 | 1.2% |
| 84 | 10 | 1.2% |
| 80 | 10 | 1.2% |
| Other values (416) | 642 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 50.2 | 1 | 0.1% |
| 51.25 | 1 | 0.1% |
| 52.15 | 1 | 0.1% |
| 52.15 | 1 | 0.1% |
| 52.5 | 2 | 0.2% |
| 52.9 | 1 | 0.1% |
| 54 | 1 | 0.1% |
| 54.2 | 1 | 0.1% |
| 54.95 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 91.8 | 1 | 0.1% |
| 91.43 | 1 | 0.1% |
| 91 | 1 | 0.1% |
| 90.9 | 1 | 0.1% |
| 89.5 | 1 | 0.1% |
| 89 | 3 | |
| 88.8 | 1 | 0.1% |
| 88.68 | 1 | 0.1% |
| 88.55 | 1 | 0.1% |
| 88.5 | 1 | 0.1% |
presion
Real number (ℝ)
ZEROS 
| Distinct | 95 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 420.48345 |
| Minimum | -999 |
|---|---|
| Maximum | 854.6 |
| Zeros | 242 |
| Zeros (%) | 29.4% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 0 |
| median | 826 |
| Q3 | 851.5 |
| 95-th percentile | 853 |
| Maximum | 854.6 |
| Range | 1853.6 |
| Interquartile range (IQR) | 851.5 |
Descriptive statistics
| Standard deviation | 590.81487 |
|---|---|
| Coefficient of variation (CV) | 1.4050847 |
| Kurtosis | 0.37110377 |
| Mean | 420.48345 |
| Median Absolute Deviation (MAD) | 26.5 |
| Skewness | -1.1897845 |
| Sum | 345637.4 |
| Variance | 349062.21 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 242 | |
| -999 | 77 | 9.4% |
| 825.8 | 17 | 2.1% |
| 826.7 | 17 | 2.1% |
| 852.4 | 16 | 1.9% |
| 852.3 | 15 | 1.8% |
| 852.1 | 15 | 1.8% |
| 826.5 | 14 | 1.7% |
| 852.9 | 13 | 1.6% |
| 852 | 13 | 1.6% |
| Other values (85) | 383 |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 0 | 242 | |
| 823.5 | 1 | 0.1% |
| 824.1 | 1 | 0.1% |
| 824.2 | 1 | 0.1% |
| 824.3 | 2 | 0.2% |
| 824.4 | 1 | 0.1% |
| 824.6 | 3 | 0.4% |
| 824.7 | 4 | 0.5% |
| 824.8 | 5 | 0.6% |
| Value | Count | Frequency (%) |
| 854.6 | 1 | 0.1% |
| 854.1 | 2 | |
| 854 | 1 | 0.1% |
| 853.9 | 4 | |
| 853.85 | 1 | 0.1% |
| 853.8 | 2 | |
| 853.7 | 2 | |
| 853.6 | 2 | |
| 853.55 | 1 | 0.1% |
| 853.5 | 2 |
p1
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.8 KiB |
| 0.0 | |
|---|---|
| -999.0 |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.310219 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2721 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 737 | |
| -999.0 | 85 | 10.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 737 | |
| 999.0 | 85 | 10.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1559 | |
| . | 822 | |
| 9 | 255 | 9.4% |
| - | 85 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2721 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1559 | |
| . | 822 | |
| 9 | 255 | 9.4% |
| - | 85 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2721 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1559 | |
| . | 822 | |
| 9 | 255 | 9.4% |
| - | 85 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2721 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1559 | |
| . | 822 | |
| 9 | 255 | 9.4% |
| - | 85 | 3.1% |
velocidad_prom
Real number (ℝ)
| Distinct | 163 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -92.201259 |
| Minimum | -999 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 1.11125 |
| median | 1.415 |
| Q3 | 1.7875 |
| 95-th percentile | 2.28 |
| Maximum | 3.5 |
| Range | 1002.5 |
| Interquartile range (IQR) | 0.67625 |
Descriptive statistics
| Standard deviation | 291.70438 |
|---|---|
| Coefficient of variation (CV) | -3.1637787 |
| Kurtosis | 5.8212824 |
| Mean | -92.201259 |
| Median Absolute Deviation (MAD) | 0.315 |
| Skewness | -2.7941186 |
| Sum | -75789.435 |
| Variance | 85091.443 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 1.4 | 58 | 7.1% |
| 1.5 | 58 | 7.1% |
| 1.2 | 58 | 7.1% |
| 1.6 | 47 | 5.7% |
| 1.3 | 47 | 5.7% |
| 1.8 | 33 | 4.0% |
| 1.7 | 31 | 3.8% |
| 1.1 | 25 | 3.0% |
| 1 | 22 | 2.7% |
| Other values (153) | 366 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 0.1 | 2 | 0.2% |
| 0.2 | 1 | 0.1% |
| 0.5 | 1 | 0.1% |
| 0.6 | 9 | 1.1% |
| 0.7 | 19 | 2.3% |
| 0.8 | 18 | 2.2% |
| 0.9 | 15 | 1.8% |
| 0.93 | 1 | 0.1% |
| 0.99 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 3.5 | 1 | |
| 3.3 | 1 | |
| 3.1 | 2 | |
| 3 | 2 | |
| 2.9 | 1 | |
| 2.7 | 2 | |
| 2.61 | 1 | |
| 2.6 | 2 | |
| 2.59 | 1 | |
| 2.5 | 1 |
velocidad_max
Real number (ℝ)
| Distinct | 57 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -91.358942 |
| Minimum | -999 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 1.8 |
| median | 2.3 |
| Q3 | 2.9 |
| 95-th percentile | 3.5 |
| Maximum | 5 |
| Range | 1004 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 291.97579 |
|---|---|
| Coefficient of variation (CV) | -3.1959191 |
| Kurtosis | 5.8212299 |
| Mean | -91.358942 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -2.7941027 |
| Sum | -75097.05 |
| Variance | 85249.86 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 1.9 | 47 | 5.7% |
| 2.9 | 39 | 4.7% |
| 2.2 | 37 | 4.5% |
| 2 | 37 | 4.5% |
| 1.8 | 37 | 4.5% |
| 2.3 | 37 | 4.5% |
| 1.7 | 36 | 4.4% |
| 3.1 | 35 | 4.3% |
| 2.8 | 35 | 4.3% |
| Other values (47) | 405 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 0.3 | 1 | 0.1% |
| 0.4 | 1 | 0.1% |
| 0.55 | 1 | 0.1% |
| 0.9 | 1 | 0.1% |
| 1 | 3 | 0.4% |
| 1.1 | 7 | 0.9% |
| 1.2 | 8 | 1.0% |
| 1.3 | 11 | 1.3% |
| 1.4 | 13 | 1.6% |
| Value | Count | Frequency (%) |
| 5 | 1 | 0.1% |
| 4.7 | 1 | 0.1% |
| 4.5 | 2 | 0.2% |
| 4.4 | 5 | |
| 4.2 | 1 | 0.1% |
| 4.1 | 2 | 0.2% |
| 4 | 4 | |
| 3.9 | 3 | |
| 3.85 | 1 | 0.1% |
| 3.8 | 3 |
direccion_prom
Real number (ℝ)
| Distinct | 336 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.837591 |
| Minimum | -999 |
|---|---|
| Maximum | 338 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 74.125 |
| median | 130 |
| Q3 | 167 |
| 95-th percentile | 272 |
| Maximum | 338 |
| Range | 1337 |
| Interquartile range (IQR) | 92.875 |
Descriptive statistics
| Standard deviation | 338.53405 |
|---|---|
| Coefficient of variation (CV) | 9.7174929 |
| Kurtosis | 5.246014 |
| Mean | 34.837591 |
| Median Absolute Deviation (MAD) | 46 |
| Skewness | -2.6116487 |
| Sum | 28636.5 |
| Variance | 114605.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 135 | 14 | 1.7% |
| 124 | 13 | 1.6% |
| 138 | 12 | 1.5% |
| 134 | 12 | 1.5% |
| 126 | 12 | 1.5% |
| 129 | 12 | 1.5% |
| 131 | 10 | 1.2% |
| 125 | 10 | 1.2% |
| 130 | 9 | 1.1% |
| Other values (326) | 641 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 0.5 | 1 | 0.1% |
| 4.5 | 1 | 0.1% |
| 29 | 1 | 0.1% |
| 30 | 2 | 0.2% |
| 32 | 1 | 0.1% |
| 32.5 | 1 | 0.1% |
| 33 | 2 | 0.2% |
| 34 | 2 | 0.2% |
| 35 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 338 | 1 | 0.1% |
| 335.5 | 1 | 0.1% |
| 327 | 3 | |
| 326 | 3 | |
| 322.5 | 1 | 0.1% |
| 321.5 | 1 | 0.1% |
| 320 | 1 | 0.1% |
| 318 | 1 | 0.1% |
| 315 | 1 | 0.1% |
| 313.5 | 1 | 0.1% |
direccion_max
Real number (ℝ)
| Distinct | 302 |
|---|---|
| Distinct (%) | 36.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.058394 |
| Minimum | -999 |
|---|---|
| Maximum | 333 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 77 |
| Negative (%) | 9.4% |
| Memory size | 12.8 KiB |
Quantile statistics
| Minimum | -999 |
|---|---|
| 5-th percentile | -999 |
| Q1 | 71.25 |
| median | 141 |
| Q3 | 187 |
| 95-th percentile | 264 |
| Maximum | 333 |
| Range | 1332 |
| Interquartile range (IQR) | 115.75 |
Descriptive statistics
| Standard deviation | 340.14222 |
|---|---|
| Coefficient of variation (CV) | 8.4911596 |
| Kurtosis | 5.2506013 |
| Mean | 40.058394 |
| Median Absolute Deviation (MAD) | 58 |
| Skewness | -2.6174331 |
| Sum | 32928 |
| Variance | 115696.73 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -999 | 77 | 9.4% |
| 182 | 14 | 1.7% |
| 199 | 12 | 1.5% |
| 150 | 12 | 1.5% |
| 141 | 12 | 1.5% |
| 175 | 11 | 1.3% |
| 180 | 10 | 1.2% |
| 68 | 9 | 1.1% |
| 174 | 9 | 1.1% |
| 45 | 8 | 1.0% |
| Other values (292) | 648 |
| Value | Count | Frequency (%) |
| -999 | 77 | |
| 0.5 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 35 | 1 | 0.1% |
| 39 | 2 | 0.2% |
| 40 | 3 | 0.4% |
| 42 | 4 | 0.5% |
| 43 | 2 | 0.2% |
| 44 | 3 | 0.4% |
| 45 | 8 | 1.0% |
| Value | Count | Frequency (%) |
| 333 | 1 | |
| 325.5 | 1 | |
| 319 | 1 | |
| 313 | 2 | |
| 302 | 1 | |
| 299 | 2 | |
| 298.5 | 2 | |
| 297.5 | 1 | |
| 292 | 2 | |
| 289 | 1 |
| anio | mes | dia | pm25 | codigoserial | dia_semana | estacion | festivo | temperatura | humedad | presion | p1 | velocidad_prom | velocidad_max | direccion_prom | direccion_max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2024 | 1 | 1 | 18.5 | 28 | Lunes | Estacion Itagui | 1 | 21.980000 | 81.000000 | 0.0 | 0.0 | 1.760 | 2.50 | 107.5 | 84.0 |
| 1 | 2024 | 1 | 2 | 11.0 | 28 | Martes | Estacion Itagui | 0 | 21.580000 | 83.000000 | 0.0 | 0.0 | 2.080 | 2.90 | 55.0 | 56.0 |
| 2 | 2024 | 1 | 3 | 13.0 | 28 | Miercoles | Estacion Itagui | 0 | 21.400000 | 76.210000 | 0.0 | 0.0 | 1.870 | 2.70 | 114.0 | 93.0 |
| 3 | 2024 | 1 | 4 | 21.0 | 28 | Jueves | Estacion Itagui | 0 | 21.804999 | 79.000000 | 0.0 | 0.0 | 1.760 | 2.55 | 164.0 | 151.0 |
| 4 | 2024 | 1 | 5 | 19.0 | 28 | Viernes | Estacion Itagui | 0 | 20.945000 | 79.320000 | 0.0 | 0.0 | 1.940 | 2.80 | 167.5 | 175.0 |
| 5 | 2024 | 1 | 6 | 16.0 | 28 | Sabado | Estacion Itagui | 0 | 22.100000 | 74.090000 | 0.0 | 0.0 | 1.905 | 2.80 | 170.0 | 182.5 |
| 6 | 2024 | 1 | 7 | 11.0 | 28 | Domingo | Estacion Itagui | 0 | 22.210000 | 74.884998 | 0.0 | 0.0 | 2.490 | 3.50 | 62.0 | 60.0 |
| 7 | 2024 | 1 | 8 | 18.0 | 28 | Lunes | Estacion Itagui | 1 | 22.500000 | 77.230000 | 0.0 | 0.0 | 1.860 | 2.60 | 166.0 | 182.0 |
| 8 | 2024 | 1 | 9 | 21.0 | 28 | Martes | Estacion Itagui | 0 | 22.900000 | 80.000000 | 0.0 | 0.0 | 1.790 | 2.60 | 164.0 | 184.0 |
| 9 | 2024 | 1 | 10 | 23.0 | 28 | Miercoles | Estacion Itagui | 0 | 22.500000 | 81.000000 | 0.0 | 0.0 | 2.025 | 2.90 | 34.0 | 43.0 |
| anio | mes | dia | pm25 | codigoserial | dia_semana | estacion | festivo | temperatura | humedad | presion | p1 | velocidad_prom | velocidad_max | direccion_prom | direccion_max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 264 | 2024 | 9 | 21 | 19.44620 | 86 | Sabado | Estacion Aranjuez | 0 | 23.30 | 62.10 | 851.0 | 0.0 | 1.1 | 1.9 | 212.0 | 210.0 |
| 265 | 2024 | 9 | 22 | 14.29740 | 86 | Domingo | Estacion Aranjuez | 0 | 20.60 | 80.90 | 851.6 | 0.0 | 1.0 | 1.7 | 228.0 | 226.0 |
| 266 | 2024 | 9 | 23 | 14.01040 | 86 | Lunes | Estacion Aranjuez | 0 | 21.30 | 78.35 | 852.4 | 0.0 | 1.2 | 2.3 | 46.0 | 73.0 |
| 267 | 2024 | 9 | 24 | 15.56025 | 86 | Martes | Estacion Aranjuez | 0 | 23.40 | 68.40 | 851.8 | 0.0 | 1.4 | 2.8 | 49.0 | 62.0 |
| 268 | 2024 | 9 | 25 | 16.60635 | 86 | Miercoles | Estacion Aranjuez | 0 | 22.80 | 70.95 | 851.9 | 0.0 | 1.2 | 2.2 | 199.0 | 194.0 |
| 269 | 2024 | 9 | 26 | 11.44115 | 86 | Jueves | Estacion Aranjuez | 0 | 19.90 | 80.10 | 852.9 | 0.0 | 0.9 | 1.7 | 247.0 | 238.0 |
| 270 | 2024 | 9 | 27 | 27.49150 | 86 | Viernes | Estacion Aranjuez | 0 | 19.85 | 79.20 | 852.4 | 0.0 | 0.6 | 1.1 | 273.0 | 270.0 |
| 271 | 2024 | 9 | 28 | 26.79975 | 86 | Sabado | Estacion Aranjuez | 0 | 19.60 | 81.70 | 852.5 | 0.0 | 0.8 | 1.5 | 198.0 | 194.0 |
| 272 | 2024 | 9 | 29 | 15.18520 | 86 | Domingo | Estacion Aranjuez | 0 | 18.70 | 87.30 | 853.3 | 0.0 | 0.6 | 1.1 | 266.5 | 256.5 |
| 273 | 2024 | 9 | 30 | 19.79815 | 86 | Lunes | Estacion Aranjuez | 0 | 19.80 | 81.70 | 852.9 | 0.0 | 0.7 | 1.3 | 258.5 | 246.0 |